================= CRYOSPARCW ======= 2021-08-08 18:42:25.615602 ========= Project P17 Job J821 Master jptitan Port 39002 =========================================================================== ========= monitor process now starting main process MAINPROCESS PID 315826 MAIN PID 315826 refine.newrun cryosparc_compute.jobs.jobregister ========= monitor process now waiting for main process ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat *************************************************************** Running job J821 of type nonuniform_refine_new Running job on hostname %s jptitan Allocated Resources : {'fixed': {'SSD': True}, 'hostname': 'jptitan', 'lane': 'default', 'lane_type': 'default', 'license': True, 'licenses_acquired': 1, 'slots': {'CPU': [0, 1, 2, 3], 'GPU': [0], 'RAM': [0, 1, 2]}, 'target': {'cache_path': '/scratch', 'cache_quota_mb': None, 'cache_reserve_mb': 10000, 'desc': None, 'gpus': [{'id': 0, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 1, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 2, 'mem': 11554324480, 'name': 'GeForce RTX 2080 Ti'}], 'hostname': 'jptitan', 'lane': 'default', 'monitor_port': None, 'name': 'jptitan', 'resource_fixed': {'SSD': True}, 'resource_slots': {'CPU': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63], 'GPU': [0, 1, 2], 'RAM': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]}, 'ssh_str': 'jparmache@jptitan', 'title': 'Worker node jptitan', 'type': 'node', 'worker_bin_path': '/data/software/cryosparc/cryosparc2_worker/bin/cryosparcw'}} HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256========= sending heartbeat grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (342, 1, 2007, 81) 218 block size 256 grid size (342, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (342, 1, 672, 44) 894 block size 256 grid size (342, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (342, 1, 224, 24) 2362 block size 256 grid size (342, 14, 1) global compute_resid_pow with (342, 1, 80, 12) 2362 block size 256 grid size (342, 5, 1) global compute_resid_pow with (342, 1, 32, 8) 2362 block size 256 grid size (342, 2, 1) global compute_resid_pow with (342, 1, 16, 4) 2362 block size 256 grid size (342, 1, 1) global compute_resid_pow with (342, 1, 8, 4) 2362 block size 128 grid size (342, 8, 1) global compute_resid_pow with (342, 1, 19, 21) 2362 block size 256 grid size (342, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256========= sending heartbeat grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362========= sending heartbeat block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (343, 1, 2007, 81) 218 block size 256 grid size (343, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (343, 1, 672, 44) 894 block size 256 grid size (343, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (343, 1, 224, 24) 2362 block size 256 grid size (343, 14, 1) global compute_resid_pow with (343, 1, 80, 12) 2362 block size 256 grid size (343, 5, 1) global compute_resid_pow with (343, 1, 32, 8) 2362 block size 256 grid size (343, 2, 1) global compute_resid_pow with (343, 1, 16, 4) 2362 block size 256 grid size (343, 1, 1) global compute_resid_pow with (343, 1, 8, 4) 2362 block size 128 grid size (343, 8, 1) global compute_resid_pow with (343, 1, 19, 21) 2362 block size 256 grid size (343, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256========= sending heartbeat grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat ========= sending heartbeat 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (342, 1, 2007, 81) 218 block size 256 grid size (342, 126, 1) global compute_resid_pow with (342, 1, 672, 44) 894 block size 256 grid size (342, 42, 1) global compute_resid_pow with (342, 1, 224, 24) 2362 block size 256 grid size (342, 14, 1) global compute_resid_pow with (342, 1, 80, 12) 2362 block size 256 grid size (342, 5, 1) global compute_resid_pow with (342, 1, 32, 8) 2362 block size 256 grid size (342, 2, 1) global compute_resid_pow with (342, 1, 16, 4) 2362 block size 256 grid size (342, 1, 1) global compute_resid_pow with (342, 1, 8, 4) 2362 block size 128 grid size (342, 8, 1) global compute_resid_pow with (342, 1, 19, 21) 2362 block size 256 grid size (342, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362========= sending heartbeat ========= sending heartbeat block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (343, 1, 2007, 81) 218 block size 256 grid size (343, 126, 1) global compute_resid_pow with (343, 1, 672, 44) 894 block size 256 grid size (343, 42, 1) global compute_resid_pow with (343, 1, 224, 24) 2362 block size 256 grid size (343, 14, 1) global compute_resid_pow with (343, 1, 80, 12) 2362 block size 256 grid size (343, 5, 1) global compute_resid_pow with (343, 1, 32, 8) 2362 block size 256 grid size (343, 2, 1) global compute_resid_pow with (343, 1, 16, 4) 2362 block size 256 grid size (343, 1, 1) global compute_resid_pow with (343, 1, 8, 4) 2362 block size 128 grid size (343, 8, 1) global compute_resid_pow with (343, 1, 19, 21) 2362 block size 256 grid size (343, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 79.371 radwn. 0.5 at 46.463 radwn. Took 2.862s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 90.746 radwn. 0.5 at 72.681 radwn. Took 3.192s. FSC Loose Mask... ========= sending heartbeat 0.143 at 98.646 radwn. 0.5 at 80.861 radwn. Took 13.045s. FSC Tight Mask... ========= sending heartbeat 0.143 at 103.754 radwn. 0.5 at 91.549 radwn. Took 11.556s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (249, 1, 2007, 81) 218 block size 256 grid size (249, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (249, 1, 672, 44) 894 block size 256 grid size (249, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (249, 1, 224, 24) 3604 block size 256 grid size (249, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (249, 1, 80, 12) 14456 block size 256 grid size (249, 5, 1) global compute_resid_pow with (249, 1, 32, 8) 16896 block size 256 grid size (249, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (249, 1, 16, 4) 16896 block size 256 grid size (249, 1, 1) global compute_resid_pow with (249, 1, 8, 4) 16896 block size 128 grid size (249, 8, 1) global compute_resid_pow with (249, 1, 19, 21) 16896 block size 256 grid size (249, 2, 1) global compute_resid_pow with (249, 1, 19, 21) 16896 block size 256 grid size (249, 2, 1) exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (249, 1, 2007, 81) 218 block size 256 grid size (249, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (249, 1, 672, 44) 894 block size 256 grid size (249, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (249, 1, 224, 24) 3604 block size 256 grid size (249, 14, 1) global compute_resid_pow with (249, 1, 80, 12) 14456 block size 256 grid size (249, 5, 1) global compute_resid_pow with (249, 1, 32, 8) 16896 block size 256 grid size (249, 2, 1) global compute_resid_pow with (249, 1, 16, 4) 16896 block size 256 grid size (249, 1, 1) global compute_resid_pow with (249, 1, 8, 4) 16896 block size 128 grid size (249, 8, 1) global compute_resid_pow with (249, 1, 19, 21) 16896 block size 256 grid size (249, 2, 1) global compute_resid_pow with (249, 1, 19, 21) 16896 block size 256 grid size (249, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (249, 1, 2007, 81) 218 block size 256 grid size (249, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (249, 1, 672, 44) 894 block size 256 grid size (249, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (249, 1, 224, 24) 3604 block size 256 grid size (249, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (249, 1, 80, 12) 14456 block size 256 grid size (249, 5, 1) global compute_resid_pow with (249, 1, 32, 8) 16896 block size 256 grid size (249, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (249, 1, 16, 4) 16896 block size 256 grid size (249, 1, 1) global compute_resid_pow with (249, 1, 8, 4) 16896 block size 128 grid size (249, 8, 1) global compute_resid_pow with (249, 1, 19, 21) 16896 block size 256 grid size (249, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (249, 1, 19, 21) 16896 block size 256 grid size (249, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (249, 1, 2007, 81) 218 block size 256 grid size (249, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (249, 1, 672, 44) 894 block size 256 grid size (249, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 16896 block size 256 grid size (500, 1, 1) global compute_resid_pow with (249, 1, 224, 24) 3604 block size 256 grid size (249, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 16896 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (249, 1, 80, 12) 14456 block size 256 grid size (249, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 16896 block size 256 grid size (500, 2, 1) global compute_resid_pow with (249, 1, 32, 8) 16896 block size 256 grid size (249, 2, 1) global compute_resid_pow with (249, 1, 16, 4) 16896 block size 256 grid size (249, 1, 1) global compute_resid_pow with (249, 1, 8, 4) 16896 block size 128 grid size (249, 8, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (249, 1, 19, 21) 16896 block size 256 grid size (249, 2, 1) global compute_resid_pow with (249, 1, 19, 21) 16896 block size 256 grid size (249, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 98.232 radwn. 0.5 at 79.045 radwn. Took 3.116s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 102.680 radwn. 0.5 at 89.919 radwn. Took 2.924s. FSC Loose Mask... ========= sending heartbeat 0.143 at 108.509 radwn. 0.5 at 97.557 radwn. Took 14.539s. FSC Tight Mask... ========= sending heartbeat 0.143 at 115.377 radwn. 0.5 at 102.873 radwn. Took 10.982s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (264, 1, 2007, 81) 218 block size 256 grid size (264, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 672, 44) 894 block size 256 grid size (264, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 224, 24) 3604 block size 256 grid size (264, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 80, 12) 14456 block size 256 grid size (264, 5, 1) global compute_resid_pow with (264, 1, 32, 8) 20904 block size 256 grid size (264, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (264, 1, 16, 4) 20904 block size 256 grid size (264, 1, 1) global compute_resid_pow with (264, 1, 8, 4) 20904 block size 128 grid size (264, 8, 1) global compute_resid_pow with (264, 1, 19, 21) 20904 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 19, 21) 20904 block size 256 grid size (264, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (264, 1, 2007, 81) 218 block size 256 grid size (264, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 672, 44) 894 block size 256 grid size (264, 42, 1) global compute_resid_pow with (264, 1, 224, 24) 3604 block size 256 grid size (264, 14, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (264, 1, 80, 12) 14456 block size 256 grid size (264, 5, 1) global compute_resid_pow with (264, 1, 32, 8) 20904 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 16, 4) 20904 block size 256 grid size (264, 1, 1) global compute_resid_pow with (264, 1, 8, 4) 20904 block size 128 grid size (264, 8, 1) global compute_resid_pow with (264, 1, 19, 21) 20904 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 19, 21) 20904 block size 256 grid size (264, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size========= sending heartbeat (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (264, 1, 2007, 81) 218 block size 256 grid size (264, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 672, 44) 894 block size 256 grid size (264, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 224, 24) 3604 block size 256 grid size (264, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 80, 12) 14456 block size 256 grid size (264, 5, 1) global compute_resid_pow with (264, 1, 32, 8) 20904 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 16, 4) 20904 block size 256 grid size (264, 1, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (264, 1, 8, 4) 20904 block size 128 grid size (264, 8, 1) global compute_resid_pow with (264, 1, 19, 21) 20904 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 19, 21) 20904 block size 256 grid size (264, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) ========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global========= sending heartbeat ========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (264, 1, 2007, 81) 218 block size 256 grid size (264, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 672, 44) 894 block size 256 grid size (264, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 20904 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 20904 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 224, 24) 3604 block size 256 grid size (264, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 20904 block size 256 grid size (500, 2, 1) global compute_resid_pow with (264, 1, 80, 12) 14456 block size 256 grid size (264, 5, 1) global compute_resid_pow with (264, 1, 32, 8) 20904 block size 256 grid size (264, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (264, 1, 16, 4) 20904 block size 256 grid size (264, 1, 1) global compute_resid_pow with (264, 1, 8, 4) 20904 block size 128 grid size (264, 8, 1) global compute_resid_pow with (264, 1, 19, 21) 20904 block size 256 grid size (264, 2, 1) global compute_resid_pow with (264, 1, 19, 21) 20904 block size 256 grid size (264, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 100.097 radwn. 0.5 at 79.916 radwn. Took 2.965s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 103.735 radwn. 0.5 at 94.720 radwn. Took 3.016s. FSC Loose Mask... ========= sending heartbeat 0.143 at 110.458 radwn. 0.5 at 100.327 radwn. Took 13.649s. FSC Tight Mask... ========= sending heartbeat 0.143 at 119.470 radwn. 0.5 at 106.234 radwn. Took 10.038s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (177, 1, 2007, 81) 218 block size 256 grid size (177, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (177, 1, 672, 44) 894 block size 256 grid size (177, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (177, 1, 224, 24) 3604 block size 256 grid size (177, 14, 1) global compute_resid_pow with (177, 1, 80, 12) 14456 block size 256 grid size (177, 5, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (177, 1, 32, 8) 22414 block size 256 grid size (177, 2, 1) global compute_resid_pow with (177, 1, 16, 4) 22414 block size 256 grid size (177, 1, 1) global compute_resid_pow with (177, 1, 8, 4) 22414 block size 128 grid size (177, 8, 1) global compute_resid_pow with (177, 1, 19, 21) 22414 block size 256 grid size (177, 2, 1) global compute_resid_pow with (177, 1, 19, 21) 22414 block size 256 grid size (177, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (178, 1, 2007, 81) 218 block size 256 grid size (178, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (178, 1, 672, 44) 894 block size 256 grid size (178, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (178, 1, 224, 24) 3604 block size 256 grid size (178, 14, 1) global compute_resid_pow with (178, 1, 80, 12) 14456 block size 256 grid size (178, 5, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (178, 1, 32, 8) 22414 block size 256 grid size (178, 2, 1) global compute_resid_pow with (178, 1, 16, 4) 22414 block size 256 grid size (178, 1, 1) global compute_resid_pow with (178, 1, 8, 4) 22414 block size 128 grid size (178, 8, 1) global compute_resid_pow with (178, 1, 19, 21) 22414 block size 256 grid size (178, 2, 1) global compute_resid_pow with (178, 1, 19, 21) 22414 block size 256 grid size (178, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size========= sending heartbeat (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size========= sending heartbeat 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (177, 1, 2007, 81) 218 block size 256 grid size (177, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (177, 1, 672, 44) 894 block size 256 grid size (177, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (177, 1, 224, 24) 3604 block size 256 grid size (177, 14, 1) global compute_resid_pow with (177, 1, 80, 12) 14456 block size 256 grid size (177, 5, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (177, 1, 32, 8) 22414 block size 256 grid size (177, 2, 1) global compute_resid_pow with (177, 1, 16, 4) 22414 block size 256 grid size (177, 1, 1) global compute_resid_pow with (177, 1, 8, 4) 22414 block size 128 grid size (177, 8, 1) global compute_resid_pow with (177, 1, 19, 21) 22414 block size 256 grid size (177, 2, 1) global compute_resid_pow with (177, 1, 19, 21) 22414 block size 256 grid size (177, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (178, 1, 2007, 81) 218 block size 256 grid size (178, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22414 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22414 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (178, 1, 672, 44) 894 block size 256 grid size (178, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 22414 block size 256 grid size (500, 2, 1) global compute_resid_pow with (178, 1, 224, 24) 3604 block size 256 grid size (178, 14, 1) global compute_resid_pow with (178, 1, 80, 12) 14456 block size 256 grid size (178, 5, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (178, 1, 32, 8) 22414 block size 256 grid size (178, 2, 1) global compute_resid_pow with (178, 1, 16, 4) 22414 block size 256 grid size (178, 1, 1) global compute_resid_pow with (178, 1, 8, 4)========= sending heartbeat 22414 block size 128 grid size (178, 8, 1) global compute_resid_pow with (178, 1, 19, 21) 22414 block size 256 grid size (178, 2, 1) global compute_resid_pow with (178, 1, 19, 21) 22414 block size 256 grid size (178, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 100.498 radwn. 0.5 at 79.991 radwn. Took 3.263s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 104.611 radwn. 0.5 at 95.587 radwn. Took 2.910s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.665 radwn. 0.5 at 101.042 radwn. Took 12.390s. FSC Tight Mask... ========= sending heartbeat 0.143 at 120.492 radwn. 0.5 at 107.011 radwn. Took 9.998s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 22810 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 22810 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 22810 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 22810 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 22810 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 22810 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 22810 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 22810 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 22810 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 22810 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 22810 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 22810 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 22810 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 22810 block size 256 grid size========= sending heartbeat (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 22810 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 22810 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 22810 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 22810 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 22810 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 22810 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 22810 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 22810 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 22810 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... ========= sending heartbeat 0.143 at 106.392 radwn. 0.5 at 96.828 radwn. Took 2.907s. FSC Spherical Mask... 0.143 at 110.085 radwn. 0.5 at 100.105 radwn. Took 3.057s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 120.742 radwn. 0.5 at 106.614 radwn. Took 14.467s. FSC Tight Mask... ========= sending heartbeat 0.143 at 130.390 radwn. 0.5 at 113.272 radwn. Took 11.035s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 129.704 radwn. 0.5 at 113.004 radwn. Took 23.164s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26424 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 26424 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 26424 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 26424 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 26424 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 26424 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 26424 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26424 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 26424 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size========= sending heartbeat (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26424 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 26424 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 106.626 radwn. 0.5 at 96.943 radwn. Took 2.384s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 110.350 radwn. 0.5 at 100.292 radwn. Took 2.952s. FSC Loose Mask... ========= sending heartbeat 0.143 at 121.970 radwn. 0.5 at 106.753 radwn. Took 12.615s. FSC Tight Mask... ========= sending heartbeat 0.143 at 131.744 radwn. 0.5 at 113.295 radwn. Took 10.248s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 131.672 radwn. 0.5 at 113.073 radwn. Took 24.413s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size ========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27236 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27236 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27236 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27236 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27236 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 27236 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 27236 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 27236 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 27236 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 27236 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27236 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27236 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27236 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27236 block size 256 grid size========= sending heartbeat (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27236 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27236 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27236 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27236 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27236 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27236 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27236 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27236 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27236 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 106.683 radwn. 0.5 at 96.963 radwn. Took 3.023s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 110.352 radwn. 0.5 at 100.334 radwn. Took 3.218s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 122.240 radwn. 0.5 at 106.748 radwn. Took 17.092s. FSC Tight Mask... ========= sending heartbeat 0.143 at 132.448 radwn. 0.5 at 113.132 radwn. Took 12.225s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 132.877 radwn. 0.5 at 112.874 radwn. Took 21.536s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27722 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27722 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27722 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27722 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27722 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 27722 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 27722 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 27722 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 27722 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 27722 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27722 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27722 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27722 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27722 block size 256 grid size========= sending heartbeat (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27722 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27722 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27722 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27722 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27722 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27722 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27722 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27722 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27722 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 106.671 radwn. 0.5 at 96.964 radwn. Took 2.558s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 110.367 radwn. 0.5 at 100.335 radwn. Took 3.179s. FSC Loose Mask... ========= sending heartbeat 0.143 at 122.406 radwn. 0.5 at 106.739 radwn. Took 13.934s. FSC Tight Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 132.863 radwn. 0.5 at 113.084 radwn. Took 11.907s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 133.153 radwn. 0.5 at 112.805 radwn. Took 24.037s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27854 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27854 block size 256 grid size (34, 1, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (34, 1, 8, 4) 27854 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27854 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27854 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 27854 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 27854 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 27854 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 27854 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 27854 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27854 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27854 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27854 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27854 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27854 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat ========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27854 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27854 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27854 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27854 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27854 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27854 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27854 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27854 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 106.794 radwn. 0.5 at 96.984 radwn. Took 3.379s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 110.385 radwn. 0.5 at 100.390 radwn. Took 3.082s. FSC Loose Mask... ========= sending heartbeat 0.143 at 122.309 radwn. 0.5 at 106.836 radwn. Took 12.831s. FSC Tight Mask... ========= sending heartbeat 0.143 at 133.103 radwn. 0.5 at 113.039 radwn. Took 10.525s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 133.084 radwn. 0.5 at 113.063 radwn. Took 21.561s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27818 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27818 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27818 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27818 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27818 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 27818 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 27818 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 27818 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 27818 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 27818 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27818 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27818 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27818 block size 128 grid size (34, 8, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (34, 1, 19, 21) 27818 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27818 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat ========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 27818 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 27818 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 27818 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 27818 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 27818 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 27818 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 27818 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 27818 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 106.807 radwn. 0.5 at 96.981 radwn. Took 2.854s. FSC Spherical Mask... 0.143 at 110.389 radwn. 0.5 at 100.375 radwn. Took 3.375s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 122.348 radwn. 0.5 at 106.886 radwn. Took 15.166s. FSC Tight Mask... ========= sending heartbeat 0.143 at 132.920 radwn. 0.5 at 113.003 radwn. Took 10.285s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 133.018 radwn. 0.5 at 112.572 radwn. Took 23.734s. ---- Computing FSC with mask 2.00 to 6.00 FSC No-Mask... ========= sending heartbeat 0.143 at 106.807 radwn. 0.5 at 96.981 radwn. Took 2.179s. FSC Spherical Mask... 0.143 at 110.389 radwn. 0.5 at 100.375 radwn. Took 3.083s. FSC Loose Mask... ========= sending heartbeat 0.143 at 122.348 radwn. 0.5 at 106.886 radwn. Took 10.500s. FSC Tight Mask... ========= sending heartbeat 0.143 at 137.861 radwn. 0.5 at 120.411 radwn. Took 10.521s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 134.286 radwn. 0.5 at 114.194 radwn. Took 22.341s. ---- Computing FSC with mask 2.25 to 7.00 FSC No-Mask... 0.143 at 106.807 radwn. 0.5 at 96.981 radwn. Took 2.160s. FSC Spherical Mask... 0.143 at 110.389 radwn. 0.5 at 100.375 radwn. Took 3.070s. FSC Loose Mask... ========= sending heartbeat 0.143 at 122.348 radwn. 0.5 at 106.886 radwn. Took 10.505s. FSC Tight Mask... ========= sending heartbeat 0.143 at 135.460 radwn. 0.5 at 118.894 radwn. Took 10.457s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 134.160 radwn. 0.5 at 115.219 radwn. Took 23.233s. ---- Computing FSC with mask 2.50 to 8.00 FSC No-Mask... 0.143 at 106.807 radwn. 0.5 at 96.981 radwn. Took 1.931s. FSC Spherical Mask... 0.143 at 110.389 radwn. 0.5 at 100.375 radwn. Took 2.773s. FSC Loose Mask... ========= sending heartbeat 0.143 at 122.348 radwn. 0.5 at 106.886 radwn. Took 9.728s. FSC Tight Mask... ========= sending heartbeat 0.143 at 134.445 radwn. 0.5 at 117.552 radwn. Took 10.394s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 134.028 radwn. 0.5 at 115.634 radwn. Took 21.327s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: *************************************************************** /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) ========= main process now complete. ========= monitor process now complete.